CONFOCAL 3D INSPECTION SYSTEM AND PROCESS 
BACKGROUND OF THE INVENTION 

Technical Field 

The present invention relates to a system, and process for use thereof, for 
inspecting wafers and other semiconductor or microelectronic substrates, and 
specifically for inspecting three dimensional (3D) surfaces or features thereon 
such as bumps. Specifically, the present invention relates to a confocal optical 
system for inspecting bumps and other 3D features on wafers or like substrates, 
and a process of using such system. 

Background Information 

Over the past several decades, the microelectronics and semiconductor 
has exponentially grown in use and popularity. Microelectronics and 
semiconductors have in effect revolutionized society by introducing computers, 
electronic advances, and generally revolutionizing many previously difficult, 
expensive and/or time consuming mechanical processes into simplistic and 
quick electronic processes. This boom has been fueled by an insatiable desire 
by business and individuals for computers and electronics, and more 
particularly, faster, more advanced computers and electronics whether it be on 
an assembly line, on test equipment in a lab, on the personal computer at one's 
desk, or in the home via electronics and toys. 

The manufacturers of microelectronics and semiconductors have made 
vast improvements in end product quality, speed and performance as well as in 
manufacturing process quality, speed and performance. However, there 
continues to be demand for faster, more reliable and higher performing 
semiconductors. 

One process that has evolved over the past decade plus is the 
microelectronic and semiconductor inspection process. The merit in inspecting 
microelectronics and semiconductors throughout the manufacturing process is 
obvious in that bad wafers may be removed at the various steps rather than 
processed to completion only to find out a defect exists either by end inspection 
or by failure during use. In the beginning, wafers and like substrates were 
manually inspected such as by humans using microscopes. As the process has 
evolved, many different systems, devices, apparatus, and methods have been 
developed to automate this process such as the method developed by August 
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Technology and disclosed in U.S. Patent Application No. 09/352,564. Many of 
these automated inspection systems, devices, apparatus, and methods focus on 
two dimensional inspection, that is inspection of wafers or substrates that are 
substantially or mostly planar in nature. 
5 One rapidly growing area in the semiconductor industry is the use of 

bumps or other three dimensional (3D) features that protrude outward from the 
wafer or substrate. The manufacturers, processors, and users of such wafers or 
like substrates having bumps or other three dimensional features or projections 
desire to inspect these wafers or like substrates in the same or similar manner 
10 as the inspection of the two dimensional substrates. However, many obstacles 
u exist as the significant height of bumps or the like causes focusing problems, 

□ shadowing problems, and just general depth perception problems. Many of the 

F! current systems, devices, apparatus, and methods are either completely 

l!j insufficient to handle these problems or cannot satisfy the speed, accuracy, and 

£ 15 other requirements. 

hi 

n\ 

J SUMMARY OF THE INVENTION 

|{ The inspecting of semiconductors or like substrates, and specifically the 

M inspection of three dimensional surfaces or features, such as bumps, is 

It 20 accomplished by the present invention, which is a confocal sensor with a given 
ril depth response functioning using the principle of eliminating out of focus light 

thereby resulting in the sensor producing a signal only when the surface being 
inspected is in a narrow focal range. The result is an accurate height 
determination for a given point or area being inspected such that the cumulation 
25 of a plurality of height determinations from use of the confocal sensor system 
across a large surface allows the user to determine the topography thereof. 

In sum, this system and process creates multiple parallel confocal optical 
paths whereby the out of focus light is eliminated by placing an aperture at a 
plane which is a conjugate focal plane to the surface of the sample. The result 
30 is that the sensor produces a signal only when the sample surface is in a narrow 
focal range. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Preferred embodiment of the invention, illustrative of the best mode in 
35 which applicant has contemplated applying the principles, are set forth in the 
following description and are shown in the drawings and are particularly and 
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distinctly pointed out and set forth in the appended claims. 

Figure 1 is a drawing of one embodiment of the present invention. 
Similar numerals refer to similar parts throughout the drawings. 

5 DESCRIPTION OF THE PREFERRED EMBODIMENT 

The three dimensional (3D) inspection system of the present invention is 
indicated generally at 120 as is best shown overall in Figure 1 and is used in 
one environment to view, inspect, or otherwise optically measure three 
dimensional features or projections on surfaces. One example is the 
10 measurement of bumps on wafers or like substrates. The 3D inspection system 
includes a light source 122, an optical subsystem 124, and a camera 126. The 
optical subsystem includes a beamsplitter 130, an aperture array 132, an object 
reimager 134, and a camera reimager 136. 

The light source 122 is any source of light that provides sufficient light to 
15 illuminate the sample S, and the light source may be positioned in any position 
so long as it provides the necessary light to sample S to be viewed, inspected or 
otherwise optically observed. Examples of the light source include, but are not 
limited to white light sources such as halogen or arc lights, lasers, light emitting 
* diodes (LEDs) including white LEDs or any of the various colored LEDs, 

20 fluorescent lights, or any other type of light source. 

In the preferred embodiment, the light source 122 is an incoherent light 
source, preferably of an incandescent type. In one embodiment, which is most 
preferred and is shown in the Figures, the light source is an incandescent quartz 
style halogen lamp. The light source is of a Kohler design, and specifically a 
25 reimaged Kohler design, which effectively matches the telecentric properties of 
the optical subsystem thereby matching the numerical aperture and field 
properties needed by the system 120 to produce accurate height measurements 
of bumps on the surface of the sample S. 

The Kohler illumination design (1) maps the pupil of light source onto 
30 spatial extension of aperture array, and (2) maps spatial extension of filament in 
light source into numerical aperture or angle space of the reimaging system. 
The reimaged Kohler design differs from a standard Kohler design in two ways: 
(1) reimaged Kohler designs have a filament that is reimaged to a circular 
aperture that very precisely defines a constant numerical aperture over an entire 
35 field, and (2) in between the filament and the sample there is a focal plane that is 
conjugated to the aperture array, and at that focal plane the light is baffled and 
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masked so that light outside of the desired range at the aperture array never 
enters the system. One baffle defines the numerical aperture and another baffle 
limits the light that passes through to only the desired field of view. 

This light source provides sufficient energy to illuminate the sample S and 
5 is typically filtered. The light emitted from the light source 122 is directed into the 
optical subsystem 122. Specifically the light is directed toward beamsplitter 130. 

In more detail and in the embodiment shown in the Figures, the optical 
subsystem 124 includes beamsplitter 130, aperture array 132, object reimager 
134, and camera reimager 136. 
10 Beamsplitter 130 in the embodiment shown is a pellicle beamsplitter. A 

It pellicle beamsplitter has several advantages since it is achromatic, has very low 

n polarization effects, and less variation with angle and color issues, and more 

Ni uniformly provides light even after beam splitting effects than a polarized 

jS beamsplitter. 

fjj 15 Another important feature is the design, setup, alignment and 

m configuration of the light source 122, pellicle beam splitter 130 and the aperture 

p array 132 as is shown in the Figure 1. The light or illumination source 122 

FU provides reflected light to the beamsplitter whereby some of this light passes 

u through the beamsplitter and eminates out of the entire system and is lost, a 

O 20 small amount may be lost within the beamsplitter, and the remaining light is 
ni reflected toward the aperture array. 

The beamsplitter 130 is pellicle and is of a broadband configuration. In 
contrast to a polarizing beamsplitter where incoming light is reflected at 90 
degrees to the path of at least one of the paths of outgoing light such that 
25 incoming and all exiting light are basically near normal incident to the faces of 
the cube, the pellicle beamsplitter in this embodiment overcomes the detrimental 
design limitations of a typical cube beamsplitter of any type including either an 
achromatic or chromatic type. This broadband configuration is necessary 
because in a typical achromatic beamsplitter it is difficult to successfully achieve 
30 very small fresnel reflections on the surfaces unless the beamsplitter includes 
coatings that adopt broad wavelength ranges which are very expensive, very 
sophisticated and difficult to provide. 

Aperture array 132 in the embodiment shown is an opaque pinhole 

array. 

35 The positioning of the aperture array into the system provides a confocal 

response. Only light that passes through an aperture in the aperture array, 
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passes through the dual telocentric object reimager, reflects off of the sample S, 
passes back through the dual telocentric object imager, and passes back 
through an aperture in the aperture array is in focus. This confocal principle 
results in bright illumination of a feature in focus while dim or no illumination of 
5 an out of focus feature. 

The object reimager 134 in the preferred embodiment shown is of a dual 
telocentric design. The object reimager includes a plurality of lenses separated 
by a stop. In one embodiment, the object reimager includes two to six lenses, 
and preferably three to four, on the right side of the reimager and two to six 
10 lenses, and preferably three to four, on the left side of the reimager separated in 
the middle by the stop. Since the reimager is dual telocentric, the stop is located 
Q one group focal length away from the cumulative location of the lenses on each 

ft . . 

}» side. 

J The object reimager functions to: (1) provide a front path for the light or 

? 15 illumination to pass from the aperture array to the object (wafer or sample S), 
[| and (2) provide a back path for the reimaging of the object (wafer or other 

? sample S) to the aperture array 1 32. 

O This system is unique because it is a dual telecentric optical reimager. 

y. This dual telecentric property means that when viewed from both ends the pupil 

tt 20 is at inifinity and that the chief rays across the entire field of view are all parallel 
[it to the optical axis. This provides two major benefits. One benefit which relates 

to the object or sample end of the reimager is that magnification across the field 

remains constant as the objectives focus in and out in relation to the sample. 

The second benefit relates to the aperture end of the reimager where the light 
25 that comes through the aperture array is collected efficiently as the telecentric 

object reimager aligns with the telecentric camera reimager. 
The optical throughput is very high. 

In an alternative embodiment, the numerical aperture of the object 
reimager may be adjustable or changeable by placing a mechanized iris in for 
30 the stop. This would allow for different depth response profile widths. This 
allows for broader ranges of bump or three dimensional measurements since the 
taller the object that it is desirable to measure the lower the desirable numerical 
aperture to maintain speed of the system. Similarly the smaller the object to be 
measured, the more desirable it is to have a higher numerical aperture to 
35 maintain sharpness, i.e., accuracy. 

The camera reimager 136 in the preferred embodiment shown is of a 
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telecentric design. The camera reimager includes a plurality of lenses separated 
by a stop. In one embodiment, the camera reimager includes two to six lenses, 
and preferably three to four, on the right side of the reimager and two to six 
lenses, and preferably three to four, on the left side of the reimager separated in 
the middle by the stop. Since the reimager is telecentric, on the telecentric side 
which is the side nearest the pellicle beamsplitter, the stop is located one group 
focal length away from the cumulative location of the lenses on that side. 

The camera reimager functions to provide a path for the light passing 
through the aperture array from the object reimager to the camera. 

The telecentric properties of the camera reimager are on the aperture 
array side or end so that it efficiently and uniformly across the field of view 
D couples the light coming through the aperture array from the object reimager 

9 134. It is pixel sampling resolution limited so its aberrations are less than that 

from the degradation of the pixel sampling. Its numerical aperture is designed 
based upon the object reimager so any misalignments between the reimagers do 
not translate into a field dependent change in efficiency across the field of view. 

The combined system magnification of the object and camera reiamgers 
is chosen to match spatial resolution at the object to pixel size. 
P In addition, an optional feature in this invention that is used in certain 

embodiments is the canting of either the sample S with reference to the optical 
axis of the entire optical subsystem, or vice versa (that is the canting of the 
entire optical subsystem with respect to the sample S). This option 
compensates for the canting of the aperture array as described above thus 
maintaining the Scheimpflug condition. In the Figure, the canting is shown as a. 

It is also an option not to cant the sample or the optical subsystem when 
the aperture array is canted. In this scenario, some desensitivity of the signal 
occurs but is often not significant or noteworthy. 

The camera 126 may be any line scan camera, area scan camera, 
combination of multiple line scan cameras, time delay integration (TDI) line scan 
camera or other camera or cameras as one of skill in the art would recognize as 
functionally operational herewith. 

In the embodiment shown in the Figures, the camera 126 is a TDI 
camera. TDI provides additional speed by transferring the charge such that the 
system integrates light over time. The aperture array with line scan camera uses 
35 only one array of pinholes while with TDI the aperture array is 100 or more 
arrays by multiple apertures in each line (an example is 100 lines by 1024 
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apertures per line). 

Sampling or viewing may be 1:1 or at another ratio. Where at 1:1, the 
camera operates at a 1 pinhole to 1 pixel ratio. Where undersampling is used, 
the camera is at a ratio other that 1:1 pinholes to pixels, and in one embodiment 
5 is at 1 Vi or 2 pinholes per pixel element at the camera sensor. 

Light passes through the system as follows: Light source 122 illuminates 
and directs such light toward beamsplitter 130. Some of the light that reaches 
the beamsplitter passes through the beamsplitter and eminates out of the entire 
system thus avoiding interference with the system, a small amount is lost within 
10 the beamsplitter, and the remaining light is reflected toward the aperture array. 
\* Light that reaches the aperture array either passes through an aperture therein, 

2 or hits the plate around the holes in the aperture array and is reflected out of the 

SI system due to the cant. Light that passed through the aperture array is 

W reimaged and collimated in the dual telecentric object reimager. The light is 

[is 15 directed toward the sample S and reflects off of the sample S. If the point that is 
CO illuminated is in or near focus, substantially all of the light reflects back into the 

U object reimager while if not in focus then little or none is reflected back. Light 

ru passes back through the object reimager and is directed toward the aperture 

[ =fc array. Light that reaches the aperture array either passes through an aperture 

K 20 therein, or hits the plate around the holes in the aperture array and is reflected 
r6 out of the system due to the cant. Light that passed through the aperture array 

is in focus due to the confocal principle, and it is reimaged and collimated in the 
telecentric camera reimager. It is directed into the camera and the intensity 
recorded. In any given pass, the above process occurs for every point on the 
25 sample that is being viewed. 

The light that passes through the system is received by camera 126 and 
stored. After this process has been repeated at different heights, and across at 
least a portion of the surface, all of the stored data is then processed by a 
computer or the like to calculate or determine the topography of the sample 
30 including the location, size, shape, contour, roughness, and/or metrology of the 
bumps or other three dimensional features thereon. 

In one of the current design and embodiment for bumps or other three 
dimensional features, the process involves two or more (generally three or more) 
passes over the sample surface S each at a different surface target elevation to 
35 measure surface elevation followed by two or more (generally three or more) 
passes each at a different bump target elevations to measure bump elevation 



7 



followed by calculations to determine bump height. The result of the passes is 
an intensity measurement for each point at each elevation where these points as 
to surface elevation and separately as to bump elevation are plotted or fitted to a 
Gaussian or other curve to determine the elevation of both the surface and the 
5 bump from which the actual bump height at a given point is determined. It is the 
difference between the surface elevation and the bump elevation. 

In more detail, a pass is made over a portion or the entire surface of the 
sample S. Intensity is determined for each pixel. Initially, a course or 
approximate surface elevation is used that is approximating the surface location 
10 or elevation of the sample S. The entire sample (or portion it is desired to 
i*h measure) is scanned and the intensities are noted for each pixel, while if very 

O small or no intensity at a given point then the system is significantly out of focus 

sJ at that location or pixel (an example is scanning at the surface elevation where 

W bumps exists results in little or no intensity feedback). This step is generally 

m 15 repeated twice more (though any number of passes may be used so long as a 
in curve can be calculated from the number of passes) at a slightly different 

;, h elevation such as 5, 10 or 20 microns difference in elevation to the first pass. 

f[j The result is three data points of intensity for each pixel to plot or fit a Gaussian 

f =fs or other curve to determine the actual wafer surface elevation at that location. 

n 20 The wafer surface elevation is now known for the entire sample except where 
HI bumps or other significant three dimensional protrusions or valleys exist since 

each of these reported no intensity as they were too out of focus to reflect back 
any light. Curve fitting may be used to determine surface location under the 
bumps. 

25 The second step is to determine the elevation of these significant 

protrusions or valleys (such as bumps). Another pass is made over a portion or 
the entire surface of the sample S (often only where bumps are expected, 
known, or no intensity was found in the surface elevation passes). This pass 
occurs at a course or rough approximation as to the elevation of the expected 

30 bumps such as 50, 100, 200, 300 or the like microns above the surface. 
Intensity is determined at each pixel as the entire sample (or only select 
locations where bumps are expected, known or no intensity was previously 
found) is scanned and the intensities are noted for each pixel, while if very small 
or no intensity at a given point then the system is significantly out of focus at that 

35 location or pixel (an example is scanning at bump elevations where no bump 
exists results in little or no intensity feedback). This step is generally repeated 
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several more times (though any number of passes may be used so long as a 
curve can be calculated from the number of passes) at a slightly different 
elevation such as 5, 10 or 20 microns different. The result is multiple data points 
of intensity for each pixel to plot or fit a Gaussian or other curve to determine the 
bump elevation at that point. 

Once the surface elevations are known and the bump elevations are 
known, the bump heights can be determined. The surface elevations are 
determined for the bump location based upon analysis, plotting, and/or other 
known curve extension techniques of all of the proximate surface elevations 
around the bump. The difference between a bump elevation and the proximate 
surface elevations therearound, or the bump elevation and the calculated 
surface elevation thereunder, equate to the bump height for a given bump. 

In sum, the scanning process for the above invention is as follows: The 
system will scan lines across the sample surface S at a fixed elevation above the 
sample surface S. This scan will generate one z axis elevation on a depth 
response curve for each pixel on the sample under the scan. The sensor will 
then be moved in the z axis direction to a second elevation and the scan will be 
repeated to generate a second z axis elevation on the depth response curve for 
each location on the sample S under the scan. This can then be repeated any 
number of times desired for the interpolation method used (typically at least two 
or three scans, although more are certainly contemplated and will improve 
accuracy). The multiple locations on the depth response curve are then 
interpolated for each pixel to generate a map of the surface height under the 
scan. The elevation of the sample surface S is now known. 

In the case of significant three dimensional protrusions (such as bumps), 
this process may be repeated at the approximate elevation of the outermost 
portion of the protrusions just as it was performed above at the approximate 
elevation of the sample surface S. The bump elevations will then be known, and 
the bump heights are then calculated as the difference between the surface 
elevation and the bump elevation. 

It is important to understand that the size of the "in focus" region is 
determined by the telecentric object reimager. If this lens has a larger numerical 
aperture, the focus range will be small, and conversely if the lens has a low 
numerical aperture the focus range will be large. The best in focus range is 
dependent on the elevation range that needs to be measured. 

The invention also in at least one embodiment is capable of adjusting 
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depth response. This is desirous since with larger bumps a broader depth 
response is desirable while with smaller bumps a thinner or smaller depth 
response is desired. In effect, the system degrades the high numerical aperture 
to look at larger or taller bumps, and this assists in maintaining speed. 
Inversely, to view smaller or thinner bumps it is desirable to provide a higher 
numerical aperture. This broadening of depth response is accomplished either 
by stopping down the aperture, by providing or increasing the tilt of the sensor, 
or by utilizing a different focal length objective lens. 

A significantly different alternative involves imaging multiple heights at 
each point rather than making multiple passes. This is accomplished by using 
multi-line line scan cameras where each camera or sensor is looking at different 
heights. For example, a four line scan camera system would involve line 1 
reading elevation 0, line 2 reading elevation plus 20 microns, line 3 reading 
elevation plus 40 microns, and line 4 reading elevation plus 60 microns. All four 
data points in this example are gathered simultaneously. Alternatively, multiple 
TDI sensors could also be used stacked close together. It is necessary to 
introduce a variable amount of optical path difference between each scan lines 
either by shifting the aperture array or introducing a difference in compensator 
thickness in a media such as glass between the aperture arrays which are in a 
plane and the end of the object imager closest to the aperture array. The result 
is multiple separate planes that are conjugated to separate z heights at the wafer 
or sample surface S. In this case where imaging occurred as to multiple heights 
on a given pass, the surface height calculation and the bump height calculation 
will involve only one pass each. 

In yet another alternative embodiment, two modes of speed are provided. 
A precise mode is provided where scanning occurs as to every die in either or 
both surface elevation determination and bump elevation determination. A faster 
mode is provided where scanning as to wafer surface elevation is performed 
only in one or a few places along the wafer and interpolation is used to calculate 
the surface over the remaining surface including at the die. 

In even yet another embodiment, single pass height determination is 
performed. Specifically, only one pass or scan occurs and gray scale variation 
is used to determine the height of each bump. As a result, only ojne scan is 
used at one z axis elevation, followed by interpolation on a gray scale. 

Accordingly, the invention as described above and understood by one of 
skill in the art is simplified, provides an effective, safe, inexpensive, and efficient 
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device, system and process which achieves all the enumerated objectives, 
provides for eliminating difficulties encountered with prior devices, systems and 
processes, and solves problems and obtains new results in the art. 

In the foregoing description, certain terms have been used for brevity, 
clearness and understanding; but no unnecessary limitations are to be implied 
therefrom beyond the requirement of the prior art, because such terms are used 
for descriptive purposes and are intended to be broadly construed. 

Moreover, the invention's description and illustration is by way of 
example, and the invention's scope is not limited to the exact details shown or 
described. 

Having now described the features, discoveries and principles of the 
invention, the manner in which it is constructed and used, the characteristics of 
the construction, and the advantageous, new and useful results obtained; the 
new and useful structures, devices, elements, arrangements, parts and 
combinations, are set forth in the appended claims. 
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